Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning
نویسندگان
چکیده
Mechanical ventilation is a key form of life support for patients with pulmonary impairment. Healthcare workers are required to continuously adjust ventilator settings each patient, challenging and time consuming task. Hence, it would be beneficial develop an automated decision tool optimize treatment. We present DeepVent, Conservative Q-Learning (CQL) based offline Deep Reinforcement Learning (DRL) agent that learns predict the optimal parameters patient promote 90 day survival. design clinically relevant intermediate reward encourages continuous improvement vitals as well addresses challenge sparse in RL. find DeepVent recommends within safe ranges, outlined recent clinical trials. The CQL algorithm offers additional safety by mitigating overestimation value estimates out-of-distribution states/actions. evaluate our using Fitted Q Evaluation (FQE) demonstrate outperforms physicians from MIMIC-III dataset.
منابع مشابه
Towards Deep Symbolic Reinforcement Learning
Deep reinforcement learning (DRL) brings the power of deep neural networks to bear on the generic task of trial-and-error learning, and its effectiveness has been convincingly demonstrated on tasks such as Atari video games and the game of Go. However, contemporary DRL systems inherit a number of shortcomings from the current generation of deep learning techniques. For example, they require ver...
متن کاملDeep Reinforcement Learning for Sepsis Treatment
Sepsis is a leading cause of mortality in intensive care units and costs hospitals billions annually. Treating a septic patient is highly challenging, because individual patients respond very differently to medical interventions and there is no universally agreed-upon treatment for sepsis. In this work, we propose an approach to deduce treatment policies for septic patients by using continuous ...
متن کاملSafe Autonomous Reinforcement Learning
In the thesis we propose, we focus on equipping existing Reinforcement Learning algorithms with different kinds of safety constraints imposed on the exploration scheme. Common Reinforcement Learning algorithms are (sometimes implicitly) assumed to work in an ergodic1, or even “restartable” environment. However, these conditions are not achievable in field robotics, where the expensive robots ca...
متن کاملTrial without Error: Towards Safe Reinforcement Learning via Human Intervention
AI systems are increasingly applied to complex tasks that involve interaction with humans. During training, such systems are potentially dangerous, as they haven’t yet learned to avoid actions that could cause serious harm. How can an AI system explore and learn without making a single mistake that harms humans or otherwise causes serious damage? For model-free reinforcement learning, having a ...
متن کاملAutonomous Quadrotor Landing using Deep Reinforcement Learning
Landing an unmanned aerial vehicle (UAV) on a ground marker is an open problem despite the effort of the research community. Previous attempts mostly focused on the analysis of hand-crafted geometric features and the use of external sensors in order to allow the vehicle to approach the land-pad. In this article, we propose a method based on deep reinforcement learning that only requires low-res...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i13.26862